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Abstract —In this paper Raptor code ensembles with linear 
random precodes in a flxed-rate setting are considered. An 
expression for the average distance spectrum is derived and 
this expression is used to obtain the asymptotic exponent of the 
weight distribution. The asymptotic growth rate analysis is then 
exploited to develop a necessary and sufficient condition under 
which the fixed-rate Raptor code ensemble exhibits a strictly 
positive typical minimum distance. 


I. Introduction 

Fountain codes 111 are erasure codes potentially able to 
generate an endless amount of encoded symbols. As such, 
they find application in contexts where the channel erasure 
rate is not a priori known. The first class of practical fountain 
codes, Luby Transform (LT) codes, was introduced in 
together with an iterative belief propagation (BP) decoding 
algorithm that is efficient when the number of input symbols k 
is large. One of the shortcomings of LT codes is that in order to 
have a low probability of unsuccessful decoding, the encoding 
cost per output symbol has to be O (In(fc)). Raptor codes |3l 
overcome this problem. They consist of a serial concatenation 
of an outer precode C with an inner LT code. The LT code 
design can thus be relaxed requiring only the recovery of a 
fraction 1 — 7 of the input symbols with 7 small. This can be 
achieved with linear encoding complexity. The outer precode 
is responsible for recovering the remaining fraction of input 
symbols, 7. If the precode C is linear-time encodable, then 
the Raptor code has a linear encoding complexity, O (k), 
and, therefore, the overall encoding cost per output symbol 
is constant with respect to k. Furthermore, Raptor codes are 
universally capacity-achieving on the binary erasure channel. 

Most of the works on LT and Raptor codes consider BP 
decoding which has a good performance for very large input 
blocks {k at least in the order of a few tens of thousands 
symbols). Often, in practice smaller blocks are used. For 
example, for the Raptor codes standardized in a and ID 
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the recommended values of k range from 1024 to 8192. For 
these input block lengths, the performance under BP decoding 
degrades considerably. In this context, an efficient maximum 
likelihood (ML) decoding algorithm in the form of inactivation 
decoding m may be used in place of BP. Recently, ML 
decoding for Raptor and LT codes has been analyzed Q-ll, 
focusing however mainly on their decoding complexity under 
inactivation decoding. 

Despite their rateless capability. Raptor codes represent 
an excellent solution for fixed-rate communication schemes 
requiring powerful erasure correction capabilities with low 
decoding complexity. Hence, it is not surprising that Raptor 
codes are actually used in a fixed-rate setting by existing 
communication systems (see e.g. ifTOl ). In this context, the 
performance under ML erasure decoding is determined by the 
distance properties of the fixed-rate Raptor code ensemble, 
that to the best knowledge of the authors have not yet been 
analyzed. 

In this paper we analyze the distance properties of fixed- 
rate Raptor codes. In particular, we focus on the case where 
the precode is picked from the linear random ensemble. The 
choice of this ensemble is not arbitrary. The precode used by 
some standardized Raptor codes IH, Q is a concatenation 
of two systematic codes, the first being a high-rate regular 
low density parity check (LDPC) code and the second being 
pseudo-random code characterized a dense parity check ma¬ 
trix. These precodes were designed to behave like codes of 
the linear random ensemble in terms of rank properties, but 
allowing a fast algorithm for matrix vector multiplication CD. 
Thus, we conjecture that the results obtained for the ensemble 
considered in this work may give (as a first approximation) 
hints on the distance properties of Raptor codes employed in 
existing systems. For this Raptor code ensemble we develop 
a necessary and sufficient condition to guarantee a strictly 
positive typical minimum distance. The condition is found 
to depend on the degree distribution of the inner LT code 
and on the code rates of both the inner LT code and the 
(outer) precode. A necessary condition is also derived which, 
beyond the inner/outer code rates, depends on the average 
output degree only. 

The rest of the paper is organized as follows. The main 
definitions are introduced in Section [B] Section [HI] provides 


k source 


h intermediate 


n output 

symbols 

{h, k) outer 
code 

symbols 

(n, h) fixed-rate 
LT code 

symbols 




u 


V 


X 


where A° is the average WE of the outer precode, and A[ ^ 
is the average input output weight enumerator (lOWE) of the 
inner LT code. The average WE of an {h, k) random code is 
known to be (see IH) 


Fig. 1. Raptor codes consist of a serial concatenation of a linear block code 
(pre-code) with a LT code. 

the derivation of the average weight distribution of this Raptor 
code ensemble and of the associated growth rate. Section HVl 
provides necessary and sufficient conditions for a positive typ¬ 
ical minimum distance. The conclusions follow in Section IV] 

II. Preliminaries 


(3) 

We now focus on the average lOWE of the LT code. Let 
us denote by I the Hamming weight of the input of the LT 
encoder, and let us assume that the output symbol of the LT 
code has degree j. Let us denote by pjj the probability that 
any of the n output bits of the LT encoder takes the value 1 
given that the Hamming weight of the intermediate word is I 
and the degree of the LT code output symbol is j, i.e.. 


We consider fixed-rate Raptor code ensembles based on the 
encoder structure depicted in Ligure [T] The encoder is given 
by a serial concatenation of an (/i, k) outer precode with an 
(n, h) inner fixed-rate LT code. We denote by u the outer 
encoder input, and by U the corresponding random vector. 
Similarly, v and x denote the input and the output of the 
fixed-rate LT encoder, with V and X being the corresponding 
random vectors. The vectors u, v and x are composed by k, 
h and n symbols each. The symbols of u are referred to as 
source symbols, whereas the symbols of v and x are referred 
to as intermediate and output symbols, respectively. 

We restrict to symbols belonging to F 2 . We denote by iuh (a) 
the Hamming weight of a binary vector a. Lor a generic LT 
code output symbol Xi, deg(a:i) denotes the output symbol 
degree, i.e., the number of intermediate symbols that are added 
(in F 2 ) to produce Xi. We will respectively denote by Tq = 
k/h, r\ = h/n, and r = k/n = Ton the rates of the outer 
code, the inner LT code, and the Raptor code. 

We consider the ensemble of Raptor codes 
^ 2 , r;, To, n) obtained by a serial concatenation of 
an outer code in the (rin, ronn) binary linear random block 
code ensemble, with all possible realizations of an 
{n,r\n) fixed-rate LT code with output degree distribution 
H = {Hi, O 2 , 03 ,..., where corresponds to the 

probability of having an output symbol of degree i. We finally 
denote as H the average output degree, H = 

In the following we make use of the notion of exponential 
equivalence m, as follows. Two real-valued positive se¬ 
quences a(n) and b{n) are said to be exponentially equivalent, 
writing a(n) = b{n), when 


lim — logo 

n—¥oo Ti 


a(n) 

b(n) 


= 0 . 


( 1 ) 


Moreover, given two pairs of reals {xi,yi) and {x 2 ,y 2 ), we 
write (xi,t/i) ^ {x 2 ,y 2 ) when xi > X 2 and yi > j/ 2 - 


Pj,i ■■= PrjXi 
for any i G { 1 ,..., n} 


= l|u;H(V) = Z,deg(Xi)=j} 

. This probability may be expressed as 


min(Li) 

Pxi = ^ 

i odd 


(t) 


( 4 ) 


Removing the conditioning on j we obtain pi, the probability 
of any of the n output bits of the LT encoder taking value 1 
given a Hamming weight I for the intermediate word, i.e.. 


PI :=Pr{X, = 1|u;h(V)=Z} 
for any i G {1,..., n}. We have 

^max 

Pi ^jPiA 

Since the output bits are generated by the LT encoder indepen¬ 
dently of each other, the Hamming weight of the LT codeword 
conditioned to an intermediate word of weight I is a binomially 
distributed random variable with parameters n and pi. Hence, 
we have 

Pr{u;H(X) = u;|u;h(V) = 1 } = (5) 

The average lOWE of a LT code may be now easily calculated 
multiplying (|5]l by the number of weight-Z intermediate words, 
yielding 

Making use of Q and (| 6 ]l, for w > 0 the average WE of 
the fixed-rate Raptor code ensemble can be expressed as 


IH. Distance Spectrum oe Lixed-Rate Raptor Code 
Ensembles 

A. Average Weight Enumerator 

Let us denote by the average weight enumerator (WE) 
of the ensemble ^,r\,To,n). Lor w > 0 we have 


Aqi, - 
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( 2 ) 


B. Growth Rate of Fixed-Rate Raptor Code Ensembles 
In this subsection we compute the asymptotic exponent 
(growth rate) of the weight distribution for the ensemble 
'^oo ('^0 5 ^2, Tj, t-q), that is the ensemble ^2, r;, ro, n) in 

the limit where n tends to infinity for constant r\ and Tq. 
Hereafter, we denote the normalized output weight of the 
Raptor encoder hy w = w/n and the normalized output weight 
of the precoder (input weight to the LT encoder) hy I = l/h. 











Fig. 2. Growth rate vs. normalized output weight w. The solid line shows 
the growth rate of a linear random code with rate r = 0.99. The dot- 
dashed, dashed, and dotted lines show the growth rates G{w) of the ensemble 
^ 00 (^ 0 , = 0.99) for r\ = 0.95, 0.88 and 0.8, respectively. 



^min 


Fig. 3. Overall rate r vs. the typical minimum distance dmin- The solid line 
represents the asymptotic Gilbert-Varshamov bound. The different dashed and 
dotted lines represent Raptor codes ensembles =r/ro,ro) 

with different outer code rates, Tq. 


Using the well-known exponential equivalence 
{tvw) ^ where Ht, is the binary entropy function, 

for large n the multiplicative term in front of the summation 
in O fulfills 

Therefore, = A^n in O fulfills 


The asymptotic exponent of the fixed-rate Raptor code ensem¬ 
ble weight distribution is finally 

G{w) := lim - loga 

n—¥oo ji 

= Hb(tZ’) - n(l - To) +fmax(w)- 
Moreover, the real number 

Jtnin := inf{w > 0 : G{w) > 0} 
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Using the result 

=max2”^(“) 

a 

a 

we can simplify the expression of A^n as 


is the typical minimum distance of the ensemble. 

Fig. |2] shows G{w) for the ensemble fi,?’o). 

where is the output degree distribution used in the 

standards i), fa (see details in Table |I| and Tq = 0.99 for 
three different r; values. It can be observed how the curve for 
r; = 0.95 does not cross the x-axis, the curve for r-, = 0.88 
has dmin = 0 and the curve for r, = 0.8 has dmin = 0.0005. 
The figure also shows the growth rate of the precode, a linear 
random code with r = 0.99. It can be observed how the typical 
minimum distance of the precode is larger than that of the 
concatenated (Raptor) code. 

Fig. [3 shows the overall rate r of the Raptor code ensemble 
= r/ro,ro) versus the typical minimum 
distance d^in. It can be observed how, for constant overall 
rate r, dmin increases as the outer code rate decreases. It 
also can be observed how decreasing allows to get closer 
to the asymptotic Gilbert-Varshamov bound. 


A,-, 
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IV. Rate Regions 


where 

fmax('i^) := max f(?!;,[) 
i 

and 

fiwj) := riHb(r) +w\og2Pi + (1 - lu) loga - Pi) ■ 


We are now interested in determining whether the ensemble 
exhibits good typical distance properties. More specifically, 
we are interested in the existence of a strictly positive typical 
minimum distance. A sufficient condition for having a positive 
typical minimum distance is 

lim G(w) < 0 
10 ^ 0 + 

























which implies 


ri(l-ro)> lim ftnax(w). ( 8 ) 

u)—>^ 0 + 

Unfortunately, a closed-form expression for the right-hand side 
of (l 8 ]l does not exist in general. However, the order of limit 
and the maximization can be inverted by observing that the 
function f^ax is right-continuous at rh = 0 , that is 


lim fmax(rh) — fmax(O)- 
u)—v 0 + 

It is now possible to recast the right-hand side of (| 8 ]l as 
lim fmax(r^) = lim maxf(i(;, Z) = max lim f{w^l) 

w^0+ w^0+ I I -li;—>- 0 + 


= max 
i 


riHb(0 -f log2 (l-Pi) 

= ■ fmax(n) 


(9) 


where we emphasized that f^ax hides a dependency on r-,. 
Computing (|9]l implies candying out a maximization which 
cannot generally be computed analytically. However, the func¬ 
tion to be maximized is sufficient well behaved so that the 
maximization can be done numerically in an efficient manner. 

Definition 1 (Positive typical minimum distance region). We 
define the positive typical minimum distance region of a Raptor 
code ensemble as the set of code rate pairs (ri,ro) for 
which the ensemble possesses a positive typical minimum 
distance. Formally : 


TABLE I 

Degree distributions OW , defined in AND , DEEINED IN 


Degree 

nd) 

D(2) 

1 

0.0098 

0.0048 

2 

0.4590 

0.4965 

3 

0.2110 

0.1669 

4 

0.1134 

0.0734 

5 


0.0822 

8 


0.0575 

9 


0.0360 

10 

0.1113 


11 

0.0799 


18 


0.0012 

19 


0.0543 

40 

0.0156 


65 


0.0182 

66 


0.0091 

n 

4.6314 

5.825 


be computed more easily, and only depends on the average 
output degree. 


Proposition 1. The positive typical minimum distance region 
region of a fixed-rate Raptor code ensemble n, ro) 

fulfills f- G, where 

G := |(ri,ro) ^ (0,0)|ri < min | (10) 


^ ■■= {(n,ro) ^ (0,0)|fi™„(H,ri,ro) > O} , 

Where we have used the notation dmin = d„,i„{Tl,r\,ro) to 
emphasize the dependence on H, r\ and Tq. 

Theorem 1. An inner positive typical minimum distance 
region, , is given by 

■■= {(n,ro) ^ (0,0)|ri(l - To) > f*ax(n)}. 

Proof: It follows from ([ 8 ]) being a sufficient condition for 
having a positive typical minimum distance. ■ 

Theorem 2. The inner positive typical minimum distance 
region and the positive typical minimum distance TP region 
coincide, P = TP. 

Proof: Due to space constraints we provide only a sketch 
of the proof. The argument is based on the observation that any 
pair (rijTo) such that ri(l — Tq) < lim^i,_>,o+fmax(?h) cannot 
belong to TP since for these pairs lim^ 5 ^Q+ G{w) > 0. An 
analysis must then be carried out for those (ri,ro) pairs such 

thatri(l-ro) = limjb^o+ fmax(w), meaning lim^5^o+ G{w) = 

0. For these (ri,ro) pairs, the only possibility for having a 
positive typical minimum distance is 

lim G'(w) < 0. 

-lij—)- 0 + 

The proof is completed by showing that the above condition 
never holds, regardless of lim^_>o+ G{w) (that is regardless 
the considered {r\,ro) pair). ■ 

Although Theorems [T] and |2] fully characterize the positive 
typical minimum distance P, they require the calculation of 
In the following we introduce an outer region that can 


with 

n log 2 To 

YVo) ~ 11 /I \ I'l \ ■ 

Hb(l - To) - (1 - To) 

Proof: The proof goes by lower bounding for 

zZi —!► 0'*'. Observing (|2|l we can see how is summation 
of the number of Hamming weight wn codewords generated 
by all possible input weights to the LT encoder. A lower 
bound to is the number of Hamming weight wn code¬ 
words generated only by inputs to the LT encoder of weight 
I = 1 — To. Manipulating the expression obtained and making 
use of the exponential equivalency introduced in Section HJ the 
expression in (doll is obtained. ■ 

It is important to point out that the asymptotic exponent 
of the weight distribution captures linear-sized codewords 
M- Codewords whose weight grows with a sublinear weight 
should be subject to an ad-hoc analysis. 

We now consider two different output degree distributions 
given in Table. |I] The first one is the output degree distribution 
used in the standards ii, 0 , which we will refer to as 
Then, we consider a distribution which was designed in 
El for fc = 120000 . 

In Fig. |4] we show the positive typical minimum distance 
region, TP for and together with their outer bound to 
the positive growth region G. It can be observed how the outer 
bound is tight in both cases except for inner codes rates close 
to Ti = 1. The figure also shows several isorate curves, along 
which the overall rate of the Raptor code r stays constant. For 
example, in order to have a positive typical minimum distance 
and an overall rate r = 0.95, the figure shows that the rate of 
the precode must lay below Tq < 0.978 for both distributions. 


























Fig. 4. Positive growth rate region. The solid and dotted lines represent the 
positive growth-rate region of and and the dashed line represents 
its outer bound. The markers represent the rate point at which codes in the 
standards n. m operate for different values of k. 

It is quite remarkable that for precode rates below < 0.978 
there exist a region in which exhibits a positive typical 
minimum distance and does not, although the average 
output degree of is considerably lower than that of 
This exemplifies the fact that the distance properties of a 
Raptor code ensemble depend not only on r\ and Tq and Ct 
but also on the degree distribution fl. 

V. Conclusions 

In this work we have considered ensembles of hxed-rate 
Raptor codes which use linear random codes as precodes. We 
have derived the expression of the average weight enumerator 
of an ensemble and the expression of the growth rate of the 
weight enumerator as functions of the rate of the outer code 
and the rate and degree distribution of the inner LT code. 
Based on these expressions we are able to determine necessary 
and sufficient conditions to have Raptor code ensembles with 
a positive typical minimum distance. A simple necessary 


condition has been developed too, which requires (besides 
the inner and outer code rates) the knowledge of the average 
output degree only. Despite the fact that only binary Raptor 
codes have been considered, an extension to higher order helds 
is possible with a limited effort. 

The work presented in this paper helps to understand the 
behavior of fixed-rate Raptor codes under ML decoding and 
it can be used to design Raptor codes with good distance 
properties, for example, using numerical optimization. 
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